Benchmarking Distributed Stream Processing Platforms for IoT Applications
نویسندگان
چکیده
Internet of Things (IoT) is a technology paradigm where millions of sensors monitor, and help inform or manage, physical, environmental and human systems in real-time. The inherent closed-loop responsiveness and decision making of IoT applications makes them ideal candidates for using low latency and scalable stream processing platforms. Distributed Stream Processing Systems (DSPS) are becoming essential components of any IoT stack, but the efficacy and performance of contemporary DSPS have not been rigorously studied for IoT data streams and applications. Here, we develop a benchmark suite and performance metrics to evaluate DSPS for streaming IoT applications. The benchmark includes 13 common IoT tasks classified across various functional categories and forming micro-benchmarks, and two IoT applications for statistical summarization and predictive analytics that leverage various dataflow compositional features of DSPS. These are coupled with stream workloads sourced from real IoT observations from smart cities. We validate the IoT benchmark for the popular Apache Storm DSPS, and present empirical results.
منابع مشابه
RIoTBench: An IoT benchmark for distributed stream processing systems
The Internet of Things (IoT) is an emerging technology paradigm where millions of sensors and actuators help monitor and manage, physical, environmental and human systems in real-time. The inherent closedloop responsiveness and decision making of IoT applications make them ideal candidates for using low latency and scalable stream processing platforms. Distributed Stream Processing Systems (DSP...
متن کاملRethinking Cloud Computing For Big Data , Machine Learning , and IoT Workloads
Emerging applications such as large-scale distributed data processing, machine learning, and Internet of Things, are fueled by the computational resources provided by large scale distributed computing platforms. Cloud computing has emerged as a popular platform to run these emerging workloads. While early cloud platforms were used to mainly run web workloads, today it has become commonplace to ...
متن کاملCSA: Streaming Engine for Internet of Things
The next generation Internet will contain a multitude of geographically distributed, connected devices continuously generating data streams, and will require new data processing architectures that can handle the challenges of heterogeneity, distribution, latency and bandwidth. Stream query processing is natural technology for use in IOT applications, and embedding such processing in the network...
متن کاملBenchmarking Fast-Data Platforms for the Aadhaar Biometric Database
Aadhaar is the world’s largest biometric database with a billion records, being compiled as an identity platform to deliver social services to residents of India. Aadhaar processes streams of biometric data as residents are enrolled and updated. Besides ∼ 1 million enrollments and updates per day, up to 100 million daily biometric authentications are expected during delivery of various public s...
متن کاملSPOTLIGHT Principles for Engineering IoT Cloud Systems
ecently, we’ve seen a wide adoption and deployment of Internet of Things (IoT) infrastructures and systems for various crucial applications,1 such as logistics, smart cities,2 and healthcare. This has led to high demands on data storage, processing, and management services in cloud-based datacenters, engendering strong integration needs between IoT and cloud services. Cloud services are mature ...
متن کامل